PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Pp3c12_21500V3.3.p
Common NamePHYPADRAFT_233852
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Bryophyta; Bryophytina; Bryopsida; Funariidae; Funariales; Funariaceae; Physcomitrella
Family MYB
Protein Properties Length: 2211aa    MW: 235575 Da    PI: 5.4326
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Pp3c12_21500V3.3.pgenomeCOSMOSSView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding25.92.3e-08767810347
                         SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHHH CS
     Myb_DNA-binding   3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqky 47 
                          WT  E el+ ++v+++G++ +++Ia ++g +++  qck ++ k 
  Pp3c12_21500V3.3.p 767 QWTDRERELFTEGVRLFGKD-FERIAVHVGSTKSVGQCKAFFCKT 810
                         5*****************99.*********99********99886 PP

2Myb_DNA-binding34.16.1e-1112921333346
                          SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
     Myb_DNA-binding    3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46  
                          +WT+eE e+++d ++ +G++ W++  ++++  ++l q+k ++q+
  Pp3c12_21500V3.3.p 1292 SWTQEEKEKFADIIRNHGKD-WTRLHECLP-SKSLTQIKTYFQN 1333
                          7*****************99.*********.************9 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466896.46E-13504567IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.601.7E-4513560IPR009057Homeodomain-like
PROSITE profilePS5129317.276516567IPR017884SANT domain
SMARTSM007172.1E-7517565IPR001005SANT/Myb domain
PROSITE profilePS5129317.617763815IPR017884SANT domain
SMARTSM007171.1E-8764813IPR001005SANT/Myb domain
PfamPF002497.8E-7767809IPR001005SANT/Myb domain
SuperFamilySSF466892.09E-9767817IPR009057Homeodomain-like
CDDcd001671.27E-5768809No hitNo description
SuperFamilySSF466892.48E-1012871337IPR009057Homeodomain-like
PROSITE profilePS512938.54212881339IPR017884SANT domain
SMARTSM007177.1E-912891337IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.603.0E-712901334IPR009057Homeodomain-like
PfamPF002492.8E-812921333IPR001005SANT/Myb domain
CDDcd001671.96E-612921335No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
Plant Ontology ? help Back to Top
PO Term PO Category PO Description
PO:0000006anatomyplant protoplast
PO:0025017anatomyplant spore
PO:0030003anatomyprotonema
PO:0030018anatomygametophore
Sequence ? help Back to Top
Protein Sequence    Length: 2211 aa     Download sequence    Send to blast
MVECDGVEQE VVGKVCERVA AEAEAGGWIS KPEGVVADDV EKRGNMRAVT ECGLEESVRQ  60
GECCEALGLK ESMECQASPK VRGSVEHCGG VEVGGEVGVS DAGSRWAKED ILLRVEKVEY  120
EIEEVERELA KAEKVGSDRR AAGEWVAGVV GDEGSAGVDG VVKVEDGIDD GEAMDVDAAE  180
VHPGRVDSRE SGEAMQVACS GRNWSGGEEC GNGRWGSDGA GAAGNVGERV GGGVSEGGHG  240
GSVDDGGVLE EGELGEEKQE EWTSKQMSVG LESEGNPAYS PRAETEPAQR EEMDVTAGVS  300
KCGVAEKEEA ERKVISWDVE VVARSLMEEN KKRAEQARET FVHLLREGVG VEGTLYRCPA  360
EAGVWKENVE RHYRNQERML EKMGERRQSL RFAEQVLAMR FRALKEAWKQ EQVGMRQQQR  420
GTKPVRRWEV EKRNGTALHC HRSSLRLRPV QAGMEKVEAV SEECMKKVMA KAVVGPVRGV  480
LKMPSMIVGQ ENRLARRFES KNALVEDPVG MERERKSMNP WSWEEKRVFL EKFAVYNKNF  540
SKIASHLELK TTADCVEFYY RNQKSEDFER IRRRQQLKKR RDYSRVGGSF LSTGLQTSSQ  600
RREANGHGRT EGANVQTVGA VVGVSHISVG TKAARSSMQQ KPVERQRVSS ALEPGSLPGA  660
VEIGKGVSGK ENKWCGTGGV SGSAAGRGGI FGMVLSGATV SCGLSSAVAG AVKIGRERSS  720
VKTMVDAGLV VARCGQYEPN CFGAKGTRSI HPPLGLENFA KEEGDAQWTD RERELFTEGV  780
RLFGKDFERI AVHVGSTKSV GQCKAFFCKT RKRLGLDKLV EKYEDSLKVR YGGVMAESLE  840
CADVGRGEAA GMVAEDLRTS SLFECSCPGM EVESHGDKGD ENEKSVEAVV PVDVQEMEAV  900
EVEAVEGVRV EGVVVEVQSV ESEVVQDVAI ADDALVNGDV EPRGIEGFVA EDDASQELMS  960
KDEAIEKSVE DTGFEIAVVV ASAADADVDK ASSNASVEVT AFNEALRDET VEDAAVEQVV  1020
VSECLDAAAG ILENENLAVE GSIVLEQVGT DGHPDEGRGV GGVAVETSNG VCEEGNGDAV  1080
GATSVVVISK DVLLGSFVCL KEDTAIVDSA CPADREPVAS TPAVDASGAE DPVCVRTDAV  1140
NVPNDASVEA VKVDAAEDVF TGNKSTDLAA GCTIREGDPA DVVKVAESGL ESSKCVAAVG  1200
VDKAVFPVTK MSTQVADEPE AKASPKVDVK SEPGSPQVAT SVSTDSASFT SAAAAVSHSG  1260
EPVFSTTSSM QGREKVGRVG GGETKSRREP TSWTQEEKEK FADIIRNHGK DWTRLHECLP  1320
SKSLTQIKTY FQNSKAKLGL LNAEGVNVPG GRVAGSRKRK VDEAENGSNN VSCLSSGNEL  1380
KGGGVSASDM DGVCQNVKAA VGGVPSMNPS MGMGSGPSGL ESMLYPFLGQ RVEDQIALQN  1440
FVRMYSANGL AQNIAGGVNP FVQQYGFPMF PNAGHQRASQ LSLQQQLAAS LAQKSGQAKG  1500
LQQQELQGFG SVQQSGQASV QQKVAHQLQS AQMVRNQQLL ASVMQHQAAA HHQQNQTSKS  1560
VPHPLQPQVV VPQPGVAGQR QQHPQAQQGG SQQEQGSPGH PIGNPSLGVG NPTQASTQTK  1620
QLQQPQVSLH QQQTLQQLQH QQHQFLLQQH DHDHVRAQSH PQSHFHIQQP QNSHASIVQP  1680
LIVQKPKGVV PLQPQVTKPP LGTVAHQRCM PYRHDQQQLG PALSLLSGNG DSGLSNIRGE  1740
ISNQHGEIGD HRSCISNGGA SKGCDFSRAD FHQLSASVQR VKPSHAAIPH RPSPTPAAVS  1800
AQGRPGDVKL FGQSLLSQPT SCAVSQSAAR ELVSAADRGS LQQSPASTAA SSSLTSMPVS  1860
AASATKSHGK ESAFRGVSFM PDGLQGGRSS RGNQGSLELW NNMSDVRSQA GPVSKSIDGD  1920
SDIRSMQECR MSREAQDVES DPSTLKLTHK LQHLDQARGV QEHSGSEGFS PVVHGVGKAR  1980
ACANEHGVGV GFAVSCGEVS RTDPERRGES IGIDLGCSER SSIMASRSNR GQVESFAMQA  2040
AGLLANGPQV HRNLIDYLMA ITELHQTRSL SSHSSVGQPR LEDHQWESLT RHPTNGTAVD  2100
ALKVNPTLGL GSPIAAPQTH LSGNDLFQQC VRDSSMYFSQ HYPSLAGHGV SSSAWNGGAG  2160
LVHPSEMQRV LVNPPLSSFL PGVFSRAPVT EDHLGSTETR DPERGGGGIC *
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4a69_C1e-15475569194NUCLEAR RECEPTOR COREPRESSOR 2
4a69_D1e-15475569194NUCLEAR RECEPTOR COREPRESSOR 2
Search in ModeBase
Expression -- UniGene ? help Back to Top
UniGene ID E-value Expressed in
Ppa.109160.0protonema
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_001769009.10.0predicted protein
TrEMBLA9SRK80.0A9SRK8_PHYPA; Predicted protein
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G52250.12e-34MYB family protein
Publications ? help Back to Top
  1. Rensing SA, et al.
    The Physcomitrella genome reveals evolutionary insights into the conquest of land by plants.
    Science, 2008. 319(5859): p. 64-9
    [PMID:18079367]